Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 204254 |
| Missing cells | 196561 |
| Missing cells (%) | 6.9% |
| Duplicate rows | 267 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 21.8 MiB |
| Average record size in memory | 112.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 3 |
| Dataset has 267 (0.1%) duplicate rows | Duplicates |
COD_PRIORIDADE is highly correlated with AFLUENCIA | High correlation |
AFLUENCIA is highly correlated with COD_PRIORIDADE | High correlation |
COD_PERG is highly correlated with COD_VIA_VERDE | High correlation |
COD_VIA_VERDE is highly correlated with COD_PERG | High correlation |
QUADRO is highly correlated with COD_PERG and 1 other fields | High correlation |
COD_PERG is highly correlated with QUADRO and 1 other fields | High correlation |
HORA_ADMISSAO is highly correlated with AFLUENCIA | High correlation |
COD_VIA_VERDE is highly correlated with QUADRO and 2 other fields | High correlation |
AFLUENCIA is highly correlated with HORA_ADMISSAO | High correlation |
Internamento is highly correlated with COD_VIA_VERDE | High correlation |
COD_VIA_VERDE has 196505 (96.2%) missing values | Missing |
COD_PRIORIDADE is highly skewed (γ1 = 20.91813947) | Skewed |
Reproduction
| Analysis started | 2023-06-05 20:47:22.807351 |
|---|---|
| Analysis finished | 2023-06-05 20:47:56.368774 |
| Duration | 33.56 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
URG_EPISODIO
Real number (ℝ≥0)
| Distinct | 199485 |
|---|---|
| Distinct (%) | 97.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21556822.94 |
| Minimum | 21000001 |
|---|---|
| Maximum | 22167405 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 21000001 |
|---|---|
| 5-th percentile | 21016591.3 |
| Q1 | 21082665.75 |
| median | 21171830 |
| Q3 | 22078642.75 |
| 95-th percentile | 22148454.35 |
| Maximum | 22167405 |
| Range | 1167404 |
| Interquartile range (IQR) | 995977 |
Descriptive statistics
| Standard deviation | 499021.4174 |
|---|---|
| Coefficient of variation (CV) | 0.02314911705 |
| Kurtosis | -1.946416125 |
| Mean | 21556822.94 |
| Median Absolute Deviation (MAD) | 162038.5 |
| Skewness | 0.113569111 |
| Sum | 4.403067313 × 1012 |
| Variance | 2.49022375 × 1011 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 21049662 | 11 | < 0.1% |
| 21099307 | 8 | < 0.1% |
| 21050990 | 5 | < 0.1% |
| 21156264 | 5 | < 0.1% |
| 22020677 | 4 | < 0.1% |
| 21176272 | 4 | < 0.1% |
| 22011969 | 4 | < 0.1% |
| 22044775 | 4 | < 0.1% |
| 22090812 | 4 | < 0.1% |
| 21001618 | 4 | < 0.1% |
| Other values (199475) | 204201 |
| Value | Count | Frequency (%) |
| 21000001 | 1 | |
| 21000002 | 1 | |
| 21000004 | 1 | |
| 21000005 | 1 | |
| 21000007 | 1 | |
| 21000008 | 1 | |
| 21000009 | 1 | |
| 21000011 | 1 | |
| 21000012 | 1 | |
| 21000013 | 1 |
| Value | Count | Frequency (%) |
| 22167405 | 1 | |
| 22167401 | 1 | |
| 22167399 | 1 | |
| 22167378 | 1 | |
| 22167369 | 1 | |
| 22167365 | 1 | |
| 22167364 | 1 | |
| 22167359 | 1 | |
| 22167357 | 1 | |
| 22167345 | 1 |
| Distinct | 48 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.41482174 |
| Minimum | 1 |
|---|---|
| Maximum | 55 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 21 |
| median | 32 |
| Q3 | 44 |
| 95-th percentile | 48 |
| Maximum | 55 |
| Range | 54 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 12.72888935 |
|---|---|
| Coefficient of variation (CV) | 0.4051873813 |
| Kurtosis | -1.015125846 |
| Mean | 31.41482174 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.2869730189 |
| Sum | 6416603 |
| Variance | 162.0246242 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 44 | 42232 | |
| 16 | 20393 | |
| 37 | 20242 | |
| 21 | 18393 | 9.0% |
| 25 | 14483 | 7.1% |
| 27 | 11117 | 5.4% |
| 7 | 9371 | 4.6% |
| 32 | 7895 | 3.9% |
| 51 | 6164 | 3.0% |
| 47 | 4953 | 2.4% |
| Other values (38) | 49011 |
| Value | Count | Frequency (%) |
| 1 | 1043 | 0.5% |
| 2 | 101 | < 0.1% |
| 3 | 132 | 0.1% |
| 4 | 1 | < 0.1% |
| 7 | 9371 | |
| 8 | 1798 | 0.9% |
| 9 | 1176 | 0.6% |
| 10 | 783 | 0.4% |
| 11 | 19 | < 0.1% |
| 12 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 55 | 1429 | 0.7% |
| 54 | 357 | 0.2% |
| 51 | 6164 | 3.0% |
| 50 | 823 | 0.4% |
| 49 | 480 | 0.2% |
| 48 | 3924 | 1.9% |
| 47 | 4953 | 2.4% |
| 46 | 1614 | 0.8% |
| 45 | 1957 | 1.0% |
| 44 | 42232 |
| Distinct | 148 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.11532699 |
| Minimum | 2 |
|---|---|
| Maximum | 220 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 39 |
| Q1 | 43 |
| median | 52 |
| Q3 | 118 |
| 95-th percentile | 173 |
| Maximum | 220 |
| Range | 218 |
| Interquartile range (IQR) | 75 |
Descriptive statistics
| Standard deviation | 49.66943595 |
|---|---|
| Coefficient of variation (CV) | 0.6440929175 |
| Kurtosis | -0.2469383964 |
| Mean | 77.11532699 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 1.038331002 |
| Sum | 15751114 |
| Variance | 2467.052868 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 43 | 63434 | |
| 52 | 33832 | |
| 153 | 7981 | 3.9% |
| 49 | 7738 | 3.8% |
| 109 | 7178 | 3.5% |
| 45 | 6673 | 3.3% |
| 131 | 6220 | 3.0% |
| 140 | 5668 | 2.8% |
| 152 | 5564 | 2.7% |
| 55 | 4867 | 2.4% |
| Other values (138) | 55099 |
| Value | Count | Frequency (%) |
| 2 | 315 | 0.2% |
| 3 | 8 | < 0.1% |
| 4 | 437 | 0.2% |
| 5 | 528 | 0.3% |
| 6 | 46 | < 0.1% |
| 7 | 198 | 0.1% |
| 8 | 2868 | |
| 9 | 149 | 0.1% |
| 10 | 193 | 0.1% |
| 11 | 180 | 0.1% |
| Value | Count | Frequency (%) |
| 220 | 16 | < 0.1% |
| 219 | 117 | 0.1% |
| 218 | 87 | < 0.1% |
| 217 | 11 | < 0.1% |
| 214 | 213 | 0.1% |
| 212 | 1277 | |
| 210 | 278 | 0.1% |
| 209 | 15 | < 0.1% |
| 208 | 8 | < 0.1% |
| 206 | 533 |
| Distinct | 68474 |
|---|---|
| Distinct (%) | 33.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51410.18774 |
| Minimum | 0 |
|---|---|
| Maximum | 86398 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12410.3 |
| Q1 | 38061.25 |
| median | 52298.5 |
| Q3 | 66978 |
| 95-th percentile | 80715 |
| Maximum | 86398 |
| Range | 86398 |
| Interquartile range (IQR) | 28916.75 |
Descriptive statistics
| Standard deviation | 19645.07471 |
|---|---|
| Coefficient of variation (CV) | 0.3821241582 |
| Kurtosis | -0.2399878296 |
| Mean | 51410.18774 |
| Median Absolute Deviation (MAD) | 14423.5 |
| Skewness | -0.4203965073 |
| Sum | 1.050073649 × 1010 |
| Variance | 385928960.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 52046 | 15 | < 0.1% |
| 34818 | 15 | < 0.1% |
| 37978 | 15 | < 0.1% |
| 53629 | 14 | < 0.1% |
| 52108 | 14 | < 0.1% |
| 36229 | 13 | < 0.1% |
| 53767 | 13 | < 0.1% |
| 50550 | 13 | < 0.1% |
| 35355 | 13 | < 0.1% |
| 48633 | 13 | < 0.1% |
| Other values (68464) | 204116 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 2 | |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 2 | |
| 12 | 2 | |
| 13 | 3 |
| Value | Count | Frequency (%) |
| 86398 | 4 | |
| 86397 | 2 | |
| 86396 | 2 | |
| 86395 | 1 | < 0.1% |
| 86394 | 1 | < 0.1% |
| 86393 | 3 | |
| 86392 | 1 | < 0.1% |
| 86391 | 1 | < 0.1% |
| 86389 | 1 | < 0.1% |
| 86388 | 4 |
COD_CAUSA
Real number (ℝ≥0)
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.315501895 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 12 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.125882227 |
|---|---|
| Coefficient of variation (CV) | 0.6532944326 |
| Kurtosis | 145.1326072 |
| Mean | 6.315501895 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.036385586 |
| Sum | 1289916 |
| Variance | 17.02290415 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 152096 | |
| 10 | 23323 | 11.4% |
| 12 | 7813 | 3.8% |
| 2 | 5925 | 2.9% |
| 9 | 4332 | 2.1% |
| 1 | 2615 | 1.3% |
| 4 | 1280 | 0.6% |
| 26 | 1082 | 0.5% |
| 22 | 1069 | 0.5% |
| 19 | 1066 | 0.5% |
| Other values (15) | 3645 | 1.8% |
| Value | Count | Frequency (%) |
| 1 | 2615 | 1.3% |
| 2 | 5925 | 2.9% |
| 3 | 530 | 0.3% |
| 4 | 1280 | 0.6% |
| 5 | 152096 | |
| 6 | 56 | < 0.1% |
| 9 | 4332 | 2.1% |
| 10 | 23323 | 11.4% |
| 11 | 2 | < 0.1% |
| 12 | 7813 | 3.8% |
| Value | Count | Frequency (%) |
| 99 | 113 | 0.1% |
| 30 | 3 | < 0.1% |
| 28 | 223 | 0.1% |
| 26 | 1082 | |
| 25 | 393 | 0.2% |
| 24 | 1 | < 0.1% |
| 22 | 1069 | |
| 21 | 433 | |
| 20 | 608 | |
| 19 | 1066 |
COD_PROVENIENCIA
Real number (ℝ≥0)
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.34530239 |
| Minimum | 1 |
|---|---|
| Maximum | 33 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 5 |
| median | 5 |
| Q3 | 33 |
| 95-th percentile | 33 |
| Maximum | 33 |
| Range | 32 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 13.1043158 |
|---|---|
| Coefficient of variation (CV) | 0.8539626958 |
| Kurtosis | -1.660751777 |
| Mean | 15.34530239 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.5546154045 |
| Sum | 3133833 |
| Variance | 171.7230926 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 109480 | |
| 33 | 64983 | |
| 7 | 11926 | 5.8% |
| 30 | 8971 | 4.4% |
| 9 | 6127 | 3.0% |
| 15 | 2202 | 1.1% |
| 2 | 499 | 0.2% |
| 3 | 16 | < 0.1% |
| 1 | 6 | < 0.1% |
| 10 | 4 | < 0.1% |
| Other values (6) | 7 | < 0.1% |
| (Missing) | 33 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 6 | < 0.1% |
| 2 | 499 | 0.2% |
| 3 | 16 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 109480 | |
| 6 | 1 | < 0.1% |
| 7 | 11926 | 5.8% |
| 9 | 6127 | 3.0% |
| 10 | 4 | < 0.1% |
| 15 | 2202 | 1.1% |
| Value | Count | Frequency (%) |
| 33 | 64983 | |
| 32 | 1 | < 0.1% |
| 30 | 8971 | 4.4% |
| 28 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 2202 | 1.1% |
| 10 | 4 | < 0.1% |
| 9 | 6127 | 3.0% |
| 7 | 11926 | 5.8% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 15 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.279163137 |
| Minimum | 1 |
|---|---|
| Maximum | 98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 4 |
| Maximum | 98 |
| Range | 97 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 4.402355997 |
|---|---|
| Coefficient of variation (CV) | 1.34252424 |
| Kurtosis | 447.1704827 |
| Mean | 3.279163137 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.91813947 |
| Sum | 669733 |
| Variance | 19.38073833 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 109930 | |
| 4 | 53557 | |
| 2 | 37458 | 18.3% |
| 5 | 1473 | 0.7% |
| 1 | 1392 | 0.7% |
| 98 | 429 | 0.2% |
| (Missing) | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1392 | 0.7% |
| 2 | 37458 | 18.3% |
| 3 | 109930 | |
| 4 | 53557 | |
| 5 | 1473 | 0.7% |
| 98 | 429 | 0.2% |
| Value | Count | Frequency (%) |
| 98 | 429 | 0.2% |
| 5 | 1473 | 0.7% |
| 4 | 53557 | |
| 3 | 109930 | |
| 2 | 37458 | 18.3% |
| 1 | 1392 | 0.7% |
IDADE
Real number (ℝ≥0)
| Distinct | 106 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.15790633 |
| Minimum | 0 |
|---|---|
| Maximum | 109 |
| Zeros | 249 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 14 |
| Q1 | 35 |
| median | 53 |
| Q3 | 70 |
| 95-th percentile | 87 |
| Maximum | 109 |
| Range | 109 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 22.84906763 |
|---|---|
| Coefficient of variation (CV) | 0.4380748621 |
| Kurtosis | -0.855062268 |
| Mean | 52.15790633 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | -0.1693593515 |
| Sum | 10653461 |
| Variance | 522.0798913 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 56 | 3364 | 1.6% |
| 55 | 3363 | 1.6% |
| 49 | 3313 | 1.6% |
| 50 | 3293 | 1.6% |
| 51 | 3284 | 1.6% |
| 54 | 3267 | 1.6% |
| 57 | 3253 | 1.6% |
| 60 | 3225 | 1.6% |
| 58 | 3213 | 1.6% |
| 52 | 3195 | 1.6% |
| Other values (96) | 171484 |
| Value | Count | Frequency (%) |
| 0 | 249 | 0.1% |
| 1 | 793 | |
| 2 | 746 | |
| 3 | 616 | |
| 4 | 475 | |
| 5 | 457 | |
| 6 | 549 | |
| 7 | 601 | |
| 8 | 631 | |
| 9 | 757 |
| Value | Count | Frequency (%) |
| 109 | 3 | < 0.1% |
| 104 | 5 | < 0.1% |
| 103 | 7 | < 0.1% |
| 102 | 6 | < 0.1% |
| 101 | 12 | < 0.1% |
| 100 | 58 | < 0.1% |
| 99 | 71 | < 0.1% |
| 98 | 150 | |
| 97 | 169 | |
| 96 | 266 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 196505 |
| Missing (%) | 96.2% |
| Memory size | 1.6 MiB |
| 2.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 5187 | 2.5% |
| 1.0 | 2562 | 1.3% |
| (Missing) | 196505 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2.0 | 5187 | |
| 1.0 | 2562 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
SEXO
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 2 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 105905 | |
| 1 | 98349 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2 | 105905 | |
| 1 | 98349 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
COD_CONCELHO
Real number (ℝ≥0)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.692182283 |
| Minimum | 1 |
|---|---|
| Maximum | 24 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 9 |
| Q3 | 10 |
| 95-th percentile | 11 |
| Maximum | 24 |
| Range | 23 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.390017096 |
|---|---|
| Coefficient of variation (CV) | 0.4407094075 |
| Kurtosis | -0.6025782737 |
| Mean | 7.692182283 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.4355549112 |
| Sum | 1571159 |
| Variance | 11.49221591 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 44731 | |
| 11 | 39305 | |
| 5 | 26689 | |
| 9 | 24594 | |
| 7 | 18453 | |
| 1 | 14023 | 6.9% |
| 3 | 11796 | 5.8% |
| 4 | 6860 | 3.4% |
| 2 | 5798 | 2.8% |
| 6 | 5619 | 2.8% |
| Other values (14) | 6386 | 3.1% |
| Value | Count | Frequency (%) |
| 1 | 14023 | 6.9% |
| 2 | 5798 | 2.8% |
| 3 | 11796 | 5.8% |
| 4 | 6860 | 3.4% |
| 5 | 26689 | |
| 6 | 5619 | 2.8% |
| 7 | 18453 | |
| 8 | 724 | 0.4% |
| 9 | 24594 | |
| 10 | 44731 |
| Value | Count | Frequency (%) |
| 24 | 2 | < 0.1% |
| 23 | 36 | < 0.1% |
| 22 | 3 | < 0.1% |
| 21 | 22 | < 0.1% |
| 20 | 10 | < 0.1% |
| 19 | 25 | < 0.1% |
| 18 | 86 | < 0.1% |
| 17 | 613 | |
| 16 | 148 | 0.1% |
| 15 | 1149 |
| Distinct | 158 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.3439443 |
| Minimum | 0 |
|---|---|
| Maximum | 164 |
| Zeros | 1785 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 27 |
| median | 51 |
| Q3 | 69 |
| 95-th percentile | 92 |
| Maximum | 164 |
| Range | 164 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 26.47840032 |
|---|---|
| Coefficient of variation (CV) | 0.5366089132 |
| Kurtosis | -0.6656648834 |
| Mean | 49.3439443 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.1147599557 |
| Sum | 10078698 |
| Variance | 701.1056838 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 58 | 2943 | 1.4% |
| 62 | 2931 | 1.4% |
| 51 | 2907 | 1.4% |
| 63 | 2901 | 1.4% |
| 64 | 2891 | 1.4% |
| 61 | 2849 | 1.4% |
| 66 | 2847 | 1.4% |
| 52 | 2844 | 1.4% |
| 57 | 2839 | 1.4% |
| 54 | 2834 | 1.4% |
| Other values (148) | 175468 |
| Value | Count | Frequency (%) |
| 0 | 1785 | |
| 1 | 953 | |
| 2 | 828 | |
| 3 | 850 | |
| 4 | 915 | |
| 5 | 1060 | |
| 6 | 1135 | |
| 7 | 1311 | |
| 8 | 1524 | |
| 9 | 1654 |
| Value | Count | Frequency (%) |
| 164 | 2 | |
| 163 | 3 | |
| 162 | 1 | < 0.1% |
| 157 | 1 | < 0.1% |
| 155 | 1 | < 0.1% |
| 154 | 1 | < 0.1% |
| 153 | 1 | < 0.1% |
| 151 | 3 | |
| 150 | 1 | < 0.1% |
| 149 | 1 | < 0.1% |
LOS
Real number (ℝ≥0)
| Distinct | 57579 |
|---|---|
| Distinct (%) | 28.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22178.23181 |
| Minimum | 348 |
|---|---|
| Maximum | 2851218 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 348 |
|---|---|
| 5-th percentile | 3083 |
| Q1 | 7807 |
| median | 16003 |
| Q3 | 28184 |
| 95-th percentile | 66080.4 |
| Maximum | 2851218 |
| Range | 2850870 |
| Interquartile range (IQR) | 20377 |
Descriptive statistics
| Standard deviation | 22755.31125 |
|---|---|
| Coefficient of variation (CV) | 1.026020083 |
| Kurtosis | 1181.498907 |
| Mean | 22178.23181 |
| Median Absolute Deviation (MAD) | 9364 |
| Skewness | 11.91213607 |
| Sum | 4529992561 |
| Variance | 517804190.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4212 | 22 | < 0.1% |
| 6630 | 20 | < 0.1% |
| 5188 | 20 | < 0.1% |
| 5413 | 19 | < 0.1% |
| 4562 | 19 | < 0.1% |
| 5685 | 19 | < 0.1% |
| 3567 | 19 | < 0.1% |
| 8159 | 19 | < 0.1% |
| 4951 | 19 | < 0.1% |
| 6902 | 19 | < 0.1% |
| Other values (57569) | 204059 |
| Value | Count | Frequency (%) |
| 348 | 1 | |
| 374 | 1 | |
| 378 | 1 | |
| 383 | 1 | |
| 387 | 1 | |
| 407 | 1 | |
| 408 | 1 | |
| 418 | 1 | |
| 468 | 1 | |
| 512 | 1 |
| Value | Count | Frequency (%) |
| 2851218 | 1 | |
| 379382 | 1 | |
| 366532 | 1 | |
| 348103 | 1 | |
| 347335 | 1 | |
| 345508 | 1 | |
| 324660 | 1 | |
| 324160 | 1 | |
| 322602 | 1 | |
| 320565 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 183867 | |
| 1 | 20387 | 10.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 183867 | |
| 1 | 20387 | 10.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| URG_EPISODIO | QUADRO | COD_PERG | HORA_ADMISSAO | COD_CAUSA | COD_PROVENIENCIA | COD_PRIORIDADE | IDADE | COD_VIA_VERDE | SEXO | COD_CONCELHO | AFLUENCIA | LOS | Internamento | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 21000001 | 25 | 39 | 984 | 5.0 | NaN | 2.0 | 58 | NaN | 1 | 11 | 0 | 52236.0 | 0 |
| 1 | 21000002 | 37 | 109 | 1100 | 5.0 | 33.0 | 3.0 | 87 | NaN | 2 | 6 | 1 | 15460.0 | 0 |
| 2 | 21000004 | 32 | 131 | 1563 | 12.0 | 5.0 | 3.0 | 12 | NaN | 1 | 11 | 2 | 7737.0 | 0 |
| 3 | 21000005 | 37 | 144 | 1642 | 5.0 | 33.0 | 2.0 | 74 | NaN | 2 | 1 | 1 | 85298.0 | 0 |
| 4 | 21000007 | 7 | 43 | 2961 | 10.0 | 30.0 | 3.0 | 66 | NaN | 1 | 11 | 3 | 19719.0 | 0 |
| 5 | 21000008 | 44 | 43 | 3104 | 10.0 | 5.0 | 3.0 | 40 | NaN | 2 | 6 | 4 | 6196.0 | 0 |
| 6 | 21000009 | 21 | 68 | 3313 | 5.0 | 30.0 | 2.0 | 24 | NaN | 2 | 10 | 1 | 2207.0 | 0 |
| 7 | 21000011 | 47 | 38 | 4623 | 5.0 | 5.0 | 4.0 | 19 | NaN | 2 | 9 | 6 | 4677.0 | 0 |
| 8 | 21000012 | 43 | 131 | 4838 | 10.0 | 30.0 | 3.0 | 3 | NaN | 1 | 6 | 6 | 1642.0 | 0 |
| 9 | 21000013 | 16 | 153 | 7160 | 5.0 | 5.0 | 2.0 | 69 | NaN | 1 | 3 | 1 | 43060.0 | 0 |
Last rows
| URG_EPISODIO | QUADRO | COD_PERG | HORA_ADMISSAO | COD_CAUSA | COD_PROVENIENCIA | COD_PRIORIDADE | IDADE | COD_VIA_VERDE | SEXO | COD_CONCELHO | AFLUENCIA | LOS | Internamento | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 204244 | 22167345 | 44 | 43 | 45998 | 9.0 | 5.0 | 3.0 | 16 | NaN | 1 | 9 | 26 | 4282.0 | 0 |
| 204245 | 22167357 | 44 | 43 | 47611 | 9.0 | 5.0 | 3.0 | 10 | NaN | 2 | 1 | 19 | 3929.0 | 0 |
| 204246 | 22167359 | 21 | 52 | 47827 | 5.0 | 5.0 | 4.0 | 56 | NaN | 1 | 10 | 23 | 6053.0 | 0 |
| 204247 | 22167364 | 44 | 43 | 48281 | 10.0 | 5.0 | 3.0 | 69 | NaN | 2 | 5 | 20 | 4639.0 | 0 |
| 204248 | 22167365 | 49 | 49 | 48411 | 14.0 | 5.0 | 2.0 | 22 | NaN | 2 | 10 | 1 | 6429.0 | 0 |
| 204249 | 22167369 | 44 | 55 | 48837 | 5.0 | 5.0 | 4.0 | 28 | NaN | 1 | 5 | 23 | 5403.0 | 0 |
| 204250 | 22167378 | 37 | 52 | 50086 | 5.0 | 5.0 | 4.0 | 23 | NaN | 2 | 5 | 19 | 2234.0 | 0 |
| 204251 | 22167399 | 55 | 140 | 51985 | 5.0 | 33.0 | 4.0 | 23 | NaN | 2 | 6 | 16 | 3695.0 | 0 |
| 204252 | 22167401 | 44 | 43 | 52041 | 12.0 | 5.0 | 3.0 | 23 | NaN | 1 | 10 | 12 | 2919.0 | 0 |
| 204253 | 22167405 | 39 | 43 | 52250 | 1.0 | 33.0 | 3.0 | 47 | NaN | 2 | 5 | 13 | 2530.0 | 0 |
Most frequently occurring
| URG_EPISODIO | QUADRO | COD_PERG | HORA_ADMISSAO | COD_CAUSA | COD_PROVENIENCIA | COD_PRIORIDADE | IDADE | COD_VIA_VERDE | SEXO | COD_CONCELHO | AFLUENCIA | LOS | Internamento | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 45 | 21050990 | 37 | 184 | 29893 | 5.0 | 33.0 | 2.0 | 81 | 1.0 | 1 | 3 | 11 | 11807.0 | 0 | 5 |
| 101 | 21125630 | 27 | 46 | 64985 | 5.0 | 7.0 | 2.0 | 65 | 2.0 | 2 | 11 | 21 | 17755.0 | 0 | 3 |
| 125 | 21152687 | 27 | 46 | 77098 | 5.0 | 33.0 | 2.0 | 59 | 2.0 | 1 | 15 | 7 | 17402.0 | 0 | 3 |
| 166 | 22006087 | 27 | 46 | 52920 | 5.0 | 5.0 | 2.0 | 59 | 2.0 | 2 | 1 | 17 | 32100.0 | 1 | 3 |
| 258 | 22156127 | 27 | 46 | 55152 | 5.0 | 5.0 | 2.0 | 72 | 2.0 | 2 | 1 | 18 | 181548.0 | 0 | 3 |
| 0 | 21002639 | 27 | 46 | 48039 | 5.0 | 33.0 | 2.0 | 47 | 2.0 | 1 | 10 | 16 | 79761.0 | 0 | 2 |
| 1 | 21005100 | 37 | 8 | 68589 | 5.0 | 33.0 | 2.0 | 77 | 1.0 | 2 | 5 | 16 | 15831.0 | 0 | 2 |
| 2 | 21006553 | 37 | 184 | 55400 | 5.0 | 33.0 | 2.0 | 60 | 1.0 | 1 | 11 | 24 | 20140.0 | 1 | 2 |
| 3 | 21006785 | 27 | 46 | 37116 | 5.0 | 2.0 | 2.0 | 63 | 2.0 | 1 | 3 | 7 | 28764.0 | 1 | 2 |
| 4 | 21008154 | 27 | 46 | 18799 | 5.0 | 33.0 | 2.0 | 92 | 2.0 | 2 | 7 | 20 | 40781.0 | 0 | 2 |